cancer classification in microarray data using a hybrid selective independent component analysis (sica) and υ-support vector machine (υ-svm) algorithm
نویسندگان
چکیده
microarray data have an important role in identification and classification of the cancer tissues. having a few samples of microarrays in cancer researches is always one of the most concerns which lead to some problems in designing the classifiers. for this matter, preprocessing gene selection techniques should be utilized before classification to remove the noninformative genes from the microarray data. an appropriate gene selection method can significantly improve the performance of cancer classification. in this paper, we use selective independent component analysis (sica) for decreasing the dimension of microarray data. using this selective algorithm, we can solve the instability problem occurred in the case of employing conventional independent component analysis (ica) methods. first, the reconstruction error and selective set are analyzed as independent components of each gene, which have a small part in making error in order to reconstruct new sample. then, some of the modified support vector machine (υ‑svm) algorithm sub‑classifiers are trained, simultaneously. eventually, the best sub‑classifier with the highest recognition rate is selected. the proposed algorithm is applied on three cancer datasets (leukemia, breast cancer and lung cancer datasets), and its results are compared with other existing methods. the results illustrate that the proposed algorithm (sica + υ‑svm) has higher accuracy and validity in order to increase the classification accuracy. such that, our proposed algorithm exhibits relative improvements of 3.3% in correctness rate over ica + svm and svm algorithms in lung cancer dataset.
منابع مشابه
Feature Selection and Classification of Microarray Gene Expression Data of Ovarian Carcinoma Patients using Weighted Voting Support Vector Machine
We can reach by DNA microarray gene expression to such wealth of information with thousands of variables (genes). Analysis of this information can show genetic reasons of disease and tumor differences. In this study we try to reduce high-dimensional data by statistical method to select valuable genes with high impact as biomarkers and then classify ovarian tumor based on gene expression data of...
متن کاملModeling and design of a diagnostic and screening algorithm based on hybrid feature selection-enabled linear support vector machine classification
Background: In the current study, a hybrid feature selection approach involving filter and wrapper methods is applied to some bioscience databases with various records, attributes and classes; hence, this strategy enjoys the advantages of both methods such as fast execution, generality, and accuracy. The purpose is diagnosing of the disease status and estimating of the patient survival. Method...
متن کاملHeart Rate Variability Classification using Support Vector Machine and Genetic Algorithm
Background: Electrocardiogram (ECG) is defined as an electrical signal, which represents cardiac activity. Heart rate variability (HRV) as the variation of interval between two consecutive heartbeats represents the balance between the sympathetic and parasympathetic branches of the autonomic nervous system.Objective: In this study, we aimed to evaluate the efficiency of discrete wavelet transfo...
متن کاملSupport vector machine classification and validation of cancer tissue samples using microarray expression data
MOTIVATION DNA microarray experiments generating thousands of gene expression measurements, are being used to gather information from tissue and cell samples regarding gene expression differences that will be useful in diagnosing disease. We have developed a new method to analyse this kind of data using support vector machines (SVMs). This analysis consists of both classification of the tissue ...
متن کاملfeature selection and classification of microarray gene expression data of ovarian carcinoma patients using weighted voting support vector machine
we can reach by dna microarray gene expression to such wealth of information with thousands of variables (genes). analysis of this information can show genetic reasons of disease and tumor differences. in this study we try to reduce high-dimensional data by statistical method to select valuable genes with high impact as biomarkers and then classify ovarian tumor based on gene expression data of...
متن کاملHybrid independent component analysis and support vector machine learning scheme for face detection
In this paper we propose a new hybrid unsupervised / supervised learning scheme that integrates Independent Component Analysis (ICA) with the SupportVector Machine (SVM) approach and apply this new learning scheme to the face detection problem. In low-level feature extraction, ICA produces independent image bases that emphasize edge information in the image data. In high-level classification, S...
متن کاملمنابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
journal of medical signals and sensorsجلد ۴، شماره ۴، صفحات ۲۹۱-۰
میزبانی شده توسط پلتفرم ابری doprax.com
copyright © 2015-2023